Initial KL penalty coefficient (used for adaptive and linear control).